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■ Abstract. Helix-coil transition in polypeptides is an example of a spin model with 
£S) , a preferred spin direction, in the sense that a theoretical formulation of this problem 

requires to assign a preferred value of spin to the helical conformation in order to 
. account for different symmetries of the helical vs. the coil states. This leads to the 

spin Hamiltonian of the Generalized Model of Polypeptide Chain (GMPC) variety as 
opposed to the Potts model variety, both with many-body interactions. We compare 
the explicit solution of the Potts model and the solution of the GMPC within the 
transfer-matrix formalism. Comparison of both secular equations reveals that the 
largest eigenvalue of the Potts model with A many-body interactions is identical to the 
largest eigenvalue of the GMPC model with A — 1 many-body interactions, indicating 
the equivalence of both free energies. In distinction, the second largest eigenvalues do 
not coincide, leading to different thermal behavior of the spatial correlation length, 
related to the helix-coil transition interval. Spin models with built-in spin anisotropy 
thus engender different physical properties in the thermodynamic limit that we explore 
in detail. 
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1. Introduction 

Effective one- dimensional spin models have been widely applied to the description of 
thermodynamic properties of hard condensed matter [H[2]- However, these models are 
in general not very useful since they do not result in a phase transition, known to exist 
in such systems. Nevertheless, the description of systems like quantum dots, quantum 
wires and then all the way to different soft matter systems, now opens up a possibility to 
revitalize the importance of one-dimensional spin models for the description of systems 
with low dimensionality, where one can envision the ordered state as being realized 
only when spins have a preferred direction. The application of Potts-like many valued 
spin models allows to distinguish this preferred direction by assigning it a selected spin 
value. Assuming only nearest neighbor interactions, one can construct a Hamiltonian 
preferring spin states where nearest neighbor spins are in the same, preferred orientation. 
This model would be different from the classical Potts model and it is not clear, to 
what extent the introduction of preferred spin value influences the physical properties. 
There are several soft matter systems where this kind of considerations would make 
sense, e.g. a polypeptide undergoing helix-coil transition [3j [10] and the stretch-induced 
transformation from the standard B-DNA conformation to a more extended S-DNA 
form [El[9]. 

Specifically, for the helix-coil transition the description of polypeptide conforma- 
tions can be reduced to consideration of a pair of torsional angles, related to each of 
the peptide units [5]. A two-dimensional plot (Ramachandran's plot) of accessible vs. 
not accessible regions of these variables shows, that helix formation is promoted only 
when both torsional angles assume values from a well-defined a-helical region of values. 
When modeled in terms of spins, this furthermore implies that the helix can be formed 
only when spins take on a preferred value. This is exactly the situation we alluded 
to above in the sense that there exists a preferred orientation of spin. The situation 
with stretch-induced transformation from B-DNA to a more extended S-DNA is in fact 
similar since the formalism of the description is based on models of the helix-coil type 

A preferred direction of the spin is not the only feature differentiating between 
different models. There is also the range of interactions that one needs to consider. 
Specifically, while in the case of a polypeptide chain it is essential that three successive 
spins all be in a chosen conformation corresponding to a single helix-inducing hydrogen 
bond [3], the description of DNA over-stretching implies that up to ten successive spins 
be engaged in a double helix-engendering hydrogen bond between opposing strands [31J. 
It thus transpires from both examples that it is necessary to consider some finite range 
of interaction and thus a finite number of nearest spins, A, as crucial for the local 
formation of an ordered state - a hydrogen bond in the considered case. 

There are thus two distinguishing features of the one-dimensional spin models 
worthy of further consideration. One is the number of nearest spins, entering into 
the local formation of an ordered state, and the other one is the existence of a preferred 
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Figure 1. (COLOR ONLINE) A polypeptide chain in a trans conformation. 
Parallelograms indicate the plane of a virtual peptide bond, a) Schematic view of 
a polypeptide chain where the main-chain atoms are represented as rigid peptide 
segments, linked by virtual bonds through the C a atoms. Each segment has two 
degrees of freedom due to the rotation around the C a — C (torsional angle <j>) and 
N — C a (torsional angle ip) bonds. R stands for the amino acid residues, while all 
other atoms have the corresponding chemical labels, b) Coarse-grained representation 
of a polypeptide chain: the conformation of (i)-th repeating unit is described with the 
help of bond length U , bond angle 6i and a pair of torsional angles 4>i j V 7 * ■ 



spin orientation pertinent to the ordered state. The effects of the former can be 
analyzed within the many-valued (Potts) spin model with an arbitrary but finite range 
of many-body interactions, while the latter forms the basis of The Generalized Model 
of Polypeptide Chain (GMPC), accounting for the preferred spin orientation. GMPC 
has been formulated several decades ago [151 EH E] an d has been extensively studied 
specifically in the context of the helix-coil transition [TB~1 [20], [23]. It was shown that the 
Zimm-Bragg model [12] and the Lifson-Roig model [2] both correspond to particular 
cases of the GMPC variety with A = 2 and A = 3, respectively [T71 [18]. The Wako- 
Saito-Munoz-Eaton (WSME) model, widely applied to protein folding (see [321 1331 131] ) . 
can also be shown to be related to the GMPC model (we will further comment on this 
point in the Conclusions). There is thus a spectrum of models that fall within the same 
class as the GMPC model. On the other hand, if no preferred spin value is taken into 
consideration, the standard Potts model with nearest-neighbor interactions (A = 2) 
[U [2] can be applied to a helix-coil transition, as shown by Goldstein [19]. However, 
for A > 2 there do not seem to exist any known solutions of the one-dimensional Potts 
model. We thus embark on a detailed study and comparison of the solutions of the two 
models for different values of A, in order to connect the Potts and the GMPC model 
with many-body interactions with the models that do not allow for a spin Hamiltonian 
description but have been traditionally used to describe the statistical characteristics of 
the helix-coil transition [131 El EE] • 

2. Helix-coil model formulation in terms of spin variables 

Statistical description of polypeptide chain conformations involves important coarse- 
graining on the level of the C a atoms, because of the planar configurations of the atomic 
groups (C^, Cj_i, Oj_i, Nj) due to specific bond hybridizations. The planar structure 
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of these groups (peptide groups) allows the introduction of virtual bonds, connecting 
the neighboring asymmetric carbon atoms (Fig. (Ha)) [31 H]. The configuration of a 
polypeptide chain can then be described with the sequence of (virtual) bond vectors 
[HI HE] j related to its backbone. In this description the bond lengths {7j}, i = 1...N — 1, 
bond (valence) angles {Oi}, i = 1...N — 2, and pairs of {4>i,ipi} torsional angles can 
be associated with each repeating unit. This description can be further simplified by 
taking into account that bond lengths and angle values usually vary within very narrow 
intervals (average fluctuation of ±3 -j- 5% at room temperature) and their fluctuations 
can be ignored [H]. The only relevant variables remaining are thus the {4>i,ipi} torsional 
angles (Fig. H^b)). 

The conformational partition function of the repeating unit can be represented as 
a finite sum after discretization of the torsional angles (approximation of rotational 
isomers) [6], opening up a possibility for a spin-based description of the polypeptide 
conformations. Assume that spin ^ describing the conformation of the zth repeating 
unit can take one of the Q(> 2) values; 7, = 1 corresponding to values of the torsional 
angles {(pi, tpi} from the helical region of the Ramachandran map, while the other Q — 1 
values correspond to torsional angles from allowed (not helical) region. The magnitude 
of Q (number of spin orientations) can be identified with the ratio of the allowed region 
area versus helical region area on a Ramachandran map. According to the polypeptide 
chain geometry the equilibrium hydrogen bond formation can be established between 
the NH and CO groups, separated by three asymmetric carbon atoms [3]; the energy 
U is associated with every formed hydrogen bond corresponding to a coupling constant 
W = exp (U/T), where T is the temperature. One hydrogen bond thus restricts three 
{</>, ip} pairs of rotation angles and establishes the structure with screw symmetry (a- 
helix) [30| . Within the spin language this means that the hydrogen bond fixes three 
successive spin values along the chain. On the other hand, hydrogen bonds in double 
stranded DNA are formed between repeating units on the opposite strands and are 
approximately perpendicular to the DNA axis. Creation of hydrogen bonds in one pair 
of opposing bases thus applies restrictions to conformational states of ~ 10 neighbors (on 
the scale of single-strand Kuhn length) [31]. It makes sense to generalize and consider 
that one hydrogen bond formation restricts arbitrary (but finite) A number of spins 
[T7] , corresponding to many-body interactions. As helix formation comes at an entropic 
cost [11], the larger is A, the higher is such an entropic cost [T7] . The transformation 
from a coil to a helical conformation is energetically favorable (negative hydrogen bond 
energy is gained) but entropically unfavorable (the number of micro-states, available for 
repeating unit in a helical macro-state is decreased, as compared to the coil). As we 
show below, the compensation of energetic and entropic costs engenders a transition at 
the temperature corresponding to exp(U/T) = Q. 

To summarize, statistical description of the helix-coil transition requires three basic 
parameters: an energetic parameter, W = exp(U/T), where U is the energy of a 
hydrogen bond; an entropic parameter, Q, that stands for the number of spin values; 
and a geometric parameter, A, that describes the many-body geometry of the hydrogen 
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Figure 2. (COLOR ONLINE) Schematic representation of a 10-mer with spins 
assigned to each repeating unit. Spins are shown as red arrows with Q — 8 possible 
orientations. (Above) Assigned energy (U) in the nearest-neighbor (A = 2) GMPC 
model (with red dotted line indicating the preferred orientation of the spin). (Below) 
Potts model as in Ref. [19] . The Potts energy assignment results in a higher energy of 
the sample spin sequence. 



bond formation. The corresponding Hamiltonian can thus be built in terms of the 7j 
spins [151 [16j [T7J [HJ EOJ [23] and corresponds to the GMPC model if the proper helix 
formation demands that A successive 7's are all in the same preferred conformation 
number, e.g. 1 (see Fig. [2] top). In the case of no preferred spin assignment to the helix 
formation we are then back to the Potts model (see Fig. [2] bottom). In what follows 
we consider both types of spin Hamiltonians and discuss similarities and differences 
between the ensuing thermodynamics. 



3. Generalized model of polypeptide chain in transfer-matrix formalism 

3.1. Hamiltonian 

The Hamiltonian for the Generalized Model of Polypeptide Chain (GMPC) is defined 

as 

N 
i=l 

where J = U/T is the reduced energy of the hydrogen bond. The strength of the 
hydrogen bond is between the valence bond and the van der Waals interactions. By 
definition the energy of hydrogen bond formation is negative. 5(a, b) is the Kronecker 
symbol. When generalized to any finite A, the above Hamiltonian assumes the form 

NO N 

-(3H=jJ2 J] 5( lt . k ,l) = J^ A \ (2) 

i=l fc=A-l i=l 



where 5\ is the product of A Kronecker symbols for neighboring repeating units. 
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3.2. Transfer Matrix Approach and characteristic equation 

A transfer-matrix can be constructed corresponding to the Hamiltonian Eq. |2]for A = 2, 
A = 3 and A = 4 cases and in fact any finite 1 < Q < oo. The algorithm for larger 
values of A is similar to that for the A = 3 and A = 4 cases. 

Starting at A = 2 it is straightforward to show that the Q x Q transfer-matrix 
reads 



and contains many (Q — 1) identical rows and columns. At A = 3 the situation is more 
complicated, since many-body interactions began to play a role, and straightforward 
construction of the transfer-matrix is impossible. However, there is an efficient trick for 
the transfer-matrix construction. Instead of the three spin variables 7t-2, 7i-i, 7i, one 
introduces a pair of two-index variables fli-i = (7 4 '_i,7i) and = (ji-i, ji) and sets to 
zero all elements of the matrix (Q 2 x Q 2 ) for which 7^ 7^ 7j_i [5J. In other words, we 
write the Hamiltonian in the form 
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with the statistical weight 
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Then the (Q 2 x Q 2 ) transfer matrix reads 
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Here again, there are many (Q 2 — A) identical rows and columns 



(7) 



At A = 4 there are 



4 spin variables and we add another 2, following a similar trick as above, so that a pair 
of three-index variables reads = (7^-3, 7j'_ 2 , Yi-i) an d ^ = (li-2, 7i-i> id- The 
statistical weight is then prescribed as 

fKJVA) = e-^(7,_ 2 ,7U)^(Ti-i,7--i)- (8) 

The resulting transfer matrix has dimensions (Q 3 x Q 3 ) and is by its structure similar to 
Eq. [7J For larger As it is necessary to group 7s into fij-i and Qi in a similar way. The 
procedure can be generalized and the proper statistical weight would then be written as 



-H/T 



n*(?, 



(9) 



fe=A-2 



resulting in a transfer matrix G(A) of dimensions (Q A_1 x Q A_1 ). Since there are 
(Q A_1 — A) identical rows and columns, the characteristic equation for G(A) turns out 
to be quite simple 

A 

Pgmpc{\ W, Q, A) = A« A ~ 1 - A {A A -(W/-1+Q)A A - 1 +(W/-1)(Q-1) ^ \ A ~ k } = 0.(10) 



k=2 



It is obvious, that there are A non-trivial eigenvalues, so that to construct the 
thermodynamics of the model, it is enough to consider a transfer matrix of a much 
smaller, (A x A) size. Such a matrix has been derived in [15] by doing elementary 
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transformation over and looks like 

/ W-l W - 1 
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One can construct this transfer-matrix in the following way: 

• All elements of first row are equal to W — 1 = e J — 1; 

• All elements of the first lower pseudo-diagonal are 1; 

• The element (A, A) is Q; 

• All other elements are zero. 

Alternatively, elementary transformations can lead to 



/ W 1 
1 



g*(A) 



\ 1 






1 1 













\ 



1 

Q-l 

1 l Q-l J 



(11) 



(12) 



Both g(A) and g* (A) have much smaller size than G(A) and result in the same 
characteristic equation 

A 

Pgmpc(X, W, Q, A) = A A - (W - 1 + Q)X A -' + {W-1)(Q-1)J2 ^ = °- ( i3 ) 

k=2 

By adding artificial A = 1 root, the characteristic equation can be written in much more 
compact form 

Pgmpc(\ W, Q, A) = X A -\X -W)(X-Q)-(W- 1)(Q - 1) = 0. (14) 



4. One dimensional Potts model with many-body interactions in 
transfer-matrix formalism 

4-1. Hamiltonian 

Following Goldstein's formulation [19J, viz. without any distinction between spin values, 
we construct the Hamiltonian for a Potts model with A many-body interactions as 

N 1 

-[3H = jY, II <*(7*-*,7i-*-i)- (15) 

i=l fe=A-l 

One can notice, that for the same A many-body interactions the Hamiltonian of Potts 
model contains the product of A — 1 Kronecker symbols instead of exactly A such 
symbols as in the case of the GMPC model. This fact has important consequences, as 
we will show below. 
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4-2. Transfer Matrix Approach and characteristic equation 



The transfer-matrix corresponding to the Hamiltonian Eq. [15] can be constructed 
seriatim for A = 2, A = 3, A = 4 and then for any finite 2 < Q < oo. The algorithm 
for larger values of A is similar to that for the A = 3 and A = 4 cases. 

At A = 2 it is straightforward to show that the Q x Q transfer-matrix reads 
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At A = 3 we use the same trick used above for construction of transfer matrix of 
the GMPC model. Instead of the three spin variables 7i_2, 7t-i, 7i, we now introduce 
a pair of two-index variables f2j_i = (7t-2,7i_i) and Qi = (7t-i, 7t) and set to zero all 
elements of the matrix (Q 2 x Q 2 ) for which 7^ ^ ^i-i [5]. This results in transfer 
matrix 
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The statistical 



At A = 4 there are 4 spin variables and we add another 2, as above, 
weight is prescribed according to 

The resulting transfer matrix has dimensions (Q 3 x Q 3 ) and is by its structure similar 
to Eq. [T71 For larger As it is necessary to group 7s into fli-i and Qi accordingly. The 
procedure can be generalized and the statistical weight would be written as 

1 



g(Sk-un i ) = e- H ' T J] 



(19) 



fc=A-2 
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resulting in a transfer matrix G(A) Potts of dimensions (Q A_1 x Q A_1 ). This matrix 
differs from the GMPC case (see Eq. [7j) in that all the diagonal elements are multiplied 
by e J , while in Eq. [7] only the element (1, 1) is multiplied by this factor. Here again, 
there are a(Q, A) = Q A_1 — Q(A — 1) identical rows and columns. The corresponding 
characteristic equation then follows as 

Ppotts(X,W,Q,A) = (20) 

X a(Q,A) j A A-1 _ ( W _ 1 + Q) A A-2 + _ -Q (Q _ -Q A* -1- * J X (21) 



A-1 \ Q_1 

A A_1 — (W — 1)A A - 2 - (W - 1) xA ^ k ) = °- 1 22 j 

fc=2 / 

Elimination of trivial eigenvalues results in 

p P otts(X,W,Q,A) = (23) 

' A-1 \ 

A A -i _ (w - 1 + Q)A A ~ 2 + - 1)(Q - 1) AA ~ 1_fe x ( 24 ) 



A:=2 



A-1 



k=2 

'-1 



A A_1 — (W — 1)A A_2 — (W — 1) ^ A A-1 ~ fc J =0. (25) 

k=2 J 

Unfortunately, it is not possible to derive a simpler transfer-matrix, that would 
correspond to such characteristic equation. We have checked Eq. [23] to be true up 
to A = 7 by hand and using Wolfram Mathematica software. 

5. Comparison of characteristic equations for the GMPC and Potts models 

Since the transfer-matrix, being a matrix of statistical weights, is non-negative, 
Frobenius-Perron theorem applies, and there exists a positive, non-degenerate maximal 
eigenvalue Ax- After solving the characteristic equation and assuming cyclic boundary 
conditions, we can straightforwardly reconstruct the partition function as 

A 

Z(A)= hm J>f = Af, (26) 

i=l 

with the free energy as 

F (A) = -TN In Ai, (27) 
and the spatial correlation length as 

£(A) = In" 1 (¥) , (28) 



\A 2 y 

where Ai is the maximal and A2 is the second largest eigenvalue. This means that the 
thermodynamics of the model is determined by its characteristic equation. 




Figure 3. COLOR ONLINE) (a) Three largest eigenvalues (top) from Eq. OH vs 
W = exp[U/T] for A = 3 and Q — 60. Correlation lengths for A particle Potts and 
A - 1 particle GMPC models (bottom) (b) A = 4. (c) A = 5. 



The comparison of Eq. [13] with Eq. [23] reveals similarities and differences between 
the two considered models, as obviously 

PPotts{\ W, Q, A) = p GM pc{\ W,Q,A — 1) x p GM pc{\ W, Q = 0, A - 1) Q ~\ (29) 

and therefore the properties of the Potts model defined by pp D tts(A, W, Q, A) are related 
to the properties of the GMPC model of Pgmpc{\ W, Q, A — 1). In the region of positive 
temperatures (W > 1) the first bracket of Eq. [29]has two positive roots, while the second 
bracket has a single, positive and Q — 1 times degenerate root. 

Detailed analysis reveals that the root from the second bracket is always in between 
the two roots from the first bracket (see Fig. [3]), so that the maximal root (which 
determines the free energy) always originates from the first bracket. In other words, the 
one-dimensional Potts model with A many-body interactions has the same free energy 
as A — 1-many-body GMPC model. All averages that depend on the largest eigenvalue 
only, such as helicity degree or number of junctions [T6l [T7] . are therefore the same for 
both models. 

However, correlation lengths are obviously different, as is illustrated on Fig. [3] 
This follows since the correlation length for the Potts model is determined via Ai and 
A2, while for the GMPC model via Ai and A3. Indeed, as one can see on Fig. [3] while the 
correlation lengths of both models coincide at low W (high temperatures), they differ 
essentially at intermediate to high values of W (low temperatures). The correlation 
length of the Potts model abruptly increases close to the temperature, where energetic 
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and entropic parameters compensate each other (W = Q), while the correlation length 
of the GMPC model goes through a maximum at this point. The appearance of large 
correlations in the Potts model is a direct consequence of the absence of a preferred spin 
value, meaning that the standard Potts model is not suitable to describe the systems 
with a special, preferred direction of the spin, conditioned by an external field or by the 
structure of the system, as is the case in biopolymers. 

6. Conclusion 

The one- dimensional nature of the considered models allowed us to explicitly construct 
the transfer-matrix for A many-body Potts model Eq. [17] and to derive the corresponding 
secular equation Eq. [20] The three largest eigenvalues, most important for resulting 
thermodynamics, were considered for several finite As. As a result, it was found out 
that only the free energy (largest eigenvalue) of A many-body Potts model is equal to 
that of the A — 1 many-body GMPC model, while the correlation lengths (which depend 
on the second largest eigenvalue as well) of two models differ significantly. 

The analysis presented above shows that a Potts model with A = 2 [19], formulated 
on the level of the effective free energy, is equivalent to the one-body GMPC model 
(A = 1). Since the Zimm-Bragg model has been shown as originating from the A = 2 
GMPC model [18] , this means that in order to achieve at least the same level of 
description, the former approach should be extended to next-nearest neighbor, three- 
body, interactions. The characteristic equation of the Lifson-Roig model can be derived 
from GMPC model Hamiltonian with three-body interactions [16J, so that the Potts 
model with A = 4 would be necessary to get an equivalent free energy. 

It is worth noticing that the GMCP model is also related to another interesting 
model that has been frequently used in the framework of protein folding, namely 
the Wako-Saito-Munoz-Eaton (WSME) model [321 M> EI] . Unlike the GMPC model, 
and quite similarly to the Zimm-Bragg model, the WSME model sets out from a 
phenomenological expression of the free energy. The methodology to pass from the 
GMPC Hamiltonian model to the corresponding free energy has already been elucidated 
in Ref. [18] . and it turns out that the resulting free energy bears strong similarities with 
the corresponding WSME one - being in fact equivalent for finite range interactions, 
apart from an appropriate rescaling in the parameters. Correlation lengths are, however, 
in general different for essentially the same reasons given in the present work for the 
Potts model. As this point appears to be interesting, it will be the subject of a future 
dedicated analysis. 

Though the derivation of the secular equation of the many-body Potts model is 
straightforward, it was not derived or analyzed before. Interesting differences in the 
thermodynamic behavior appear once one of the spin values is preferred, as in the GMPC 
model. In the case when the isotropy of spins is broken in this way, the application of 
the GMPC model as opposed to the Potts model seems to be more adequate, as shown 
explicitly for the example of the helix-coil transition. Since a fruitful analogy between 
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magnetic and polymer systems is well established and long known, we believe that there 
are likewise situations in the theory of magnetism, where application of the GMPC 
model instead of the standard Potts model would lead to a more detailed understanding 
of the thermodynamic properties. 
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